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Abstract — We consider the design of multiple-input multiple- 
output communication systems with a linear precoder at the 
transmitter, zero-forcing decision feedback equalization (ZF- 
DFE) at the receiver, and a low-rate feedback channel that 
enables communication from the receiver to the transmitter. 
The channel state information (CSI) available at the receiver is 
assumed to be perfect, and based on this information the receiver 
selects a suitable precoder from a codebook and feeds back the 
index of this precoder to the transmitter. Our approach to the 
design of the components of this limited feedback scheme is based 
on the development, herein, of a unified framework for the joint 
design of the precoder and the ZF-DFE under the assumption 
that perfect CSI is available at both the transmitter and the 
receiver. The framework is general and embraces a wide range 
of design criteria. This framework enables us to characterize 
the statistical distribution of the optimal precoder in a standard 
Rayleigh fading environment. Using this distribution, we show 
that codebooks constructed from Grassmann packings minimize 
an upper bound on an average distortion measure, and hence are 
natural candidates for the codebook in limited feedback systems. 
Our simulation studies show that the proposed limited feedback 
scheme can provide significantly better performance at a lower 
feedback rate than existing schemes in which the detection order 
is fed back to the transmitter. 

Index Terms — Limited feedback, Decision feedback equaliza- 
tion (DFE), Grassmann packings, Majorization, Schur-convexity. 



I. Introduction 

Multiple-input multiple-output (MIMO) communication 
schemes offer the potential for significant increases in spectral 
efficiency over their single-input single-output counterparts 
by enabling simultaneous transmission of independent data 
streams. MIMO schemes also offer the potential for sig- 
nificant performance gains in a variety of other metrics. 
Standard transceiver architectures for these schemes include 
linear precoding and equalization, and the combination of 
linear precoding and decision feedback equalization (DFE), 
which offers the potential for improved performance over the 
linear approach while maintaining comparable complexity. For 
scenarios in which accurate channel state information (CSI) is 
available at both the transmitter and the receiver, there is a 
well established framework that unifies the design of linear 
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transceivers under many design criteria [1]. A counterpart 
for the design of systems with DFE has recently emerged 
[2]-[5]. This framework was also extended to MIMO sys- 
tems with pre-interference subtraction at the transmitter in 
[2]. However, in many scenarios, such as frequency division 
duplex systems, obtaining accurate CSI at the transmitter may 
require a considerable amount of feedback to the transmitter. 
An approach that allows the designer to limit the required 
amount of the feedback is to quantize the transmitter design. 
In these limited feedback schemes [6], the receiver uses its 
CSI to choose the best transmitter design from a codebook 
of available designs, and then feeds back the index of this 
precoder to the transmitter. This strategy has been considered 
for beamforming schemes (e.g., [7]— [13]), unitary precoding 
with linear equalization (e.g., [14]). and unitary precoding for 
orthogonal space time block codes [15], [16]. For zero-forcing 
DFE schemes, a limited feedback scheme in which the receiver 
feeds back the order of interference cancellation was proposed 
in [17], [18]. 

In this work, we consider the design of a limited feedback 
scheme for systems with a (general) linear precoder at the 
transmitter and zero-forcing DFE at the receiver. Our designs 
are based on a unified framework, developed herein, for the 
joint design of the precoder and the ZF-DFE in the presence of 
perfect CSI. This framework embraces a wide range of design 
criteria that can be expressed as functions of the mean square 
error (MSE) of each data stream, including minimization of 
the total MSE, minimization of the average bit error rate 
(BER), and maximization of the Gaussian mutual information. 
In particular, we show that the optimal precoder for systems 
with a zero-forcing DFE is the same for all these criteria; 
a property that cannot be achieved by a linear transceiver. 
Furthermore, we show that the optimal precoder for these 
objectives is a scaled unitary matrix that is isotropically 
distributed (over the Stiefel manifold of unitary matrices). 
Using this distribution, we show that codebooks constructed 
from Grassmann subspace packings minimize an upper bound 
on an average distortion measure, and hence are excellent 
candidates for the codebook in limited feedback schemes for 
systems with zero-forcing DFE. In contrast, the application of 
Grassmann codebooks in limited feedback schemes with linear 
receivers (e.g., [14]) involves an inherent compromise, because 
the optimal precoder in the presence of perfect CSI and a 
total power constraint is not unitary. Since the scheme that we 
propose involves the construction of codebooks for isotropi- 
cally distributed unitary matrices, our scheme subsumes that 
in [17], [18], in which the precoder is, by construction, a 
permutation matrix. Our simulation studies suggest that the 
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additional degrees of freedom available in our approach enable 
our scheme to provide significantly better performance than 
that in [17], [18] while using a lower feedback rate. 

Our notation is as follows: Boldface type is used to denote 
matrices and vectors; denotes the i th element of the vector 
a, Ajj denotes the element at the intersection of the i th row 
and j th column of the matrix A, A H denotes the conjugate 
transpose of A, and (A)t denotes the (minimum norm) 
pseudo-inverse of A. The terms det(A), and |jA||2 denote 
the determinant and the two-norm (maximum singular value) 
of A, respectively. The notation Diag(x) denotes the diagonal 
matrix whose elements are the elements of x. 

II. System Model 

We consider a point-to-point communication system with 
N t transmit antennas and N r receive antennas that transmits 
K data streams simultaneously, where K is no greater than 
the rank of the channel matrix H. We adopt a narrow 
band block fading channel model, and we consider MIMO 
communications systems that use (generalized) zero-forcing 
decision feedback equalization, e.g., [19], [20], for spatial 
equalization. At the transmitter, the input data vector s g C A 
is linearly precoded using P to generate the transmitted data 
vector x € C^*, 

x = Ps. (1) 

Without loss of generality, we will assume that Ejss^} = I, 
and hence the total transmitter power constraint can be written 
as E{x H x} = tr(P H P) < P total . 

The vector of received signals is given by 



HPs + n. 



(2) 



where H is the channel matrix and n is the vector of additive 
noise which is assumed to have zero-mean and a covariance 
matrix E{nn H } = er^I. As illustrated in Fig. 1, following 
linear processing using the feedforward matrix G, the receiver 
makes successive decisions on each symbol by subtracting the 
effect of previously decided symbols. Hence, the feedback 
matrix B is strictly lower triangular. This system model 
embraces linear precoding and equalization as a special case 
when B = 0. Assuming correct previous decisions, the vector 
of inputs to the quantizer is given by 



s = (GHP B)s + Gn. 



(3) 



By defining the error signal e = s — s, the error covariance 
matrix (the "MSE" matrix) can be written as 

E = E{ee ff } = CC ff - CP H H H G ff - GHPC ff 

+ GHPP ff H ff G H + a 2 n GG H , (4) 

where C = I + B is a unit diagonal lower triangular matrix. 

We will consider communication schemes in which perfect 
CSI is available only at the receiver. Based on its channel 
knowledge, the receiver selects a suitable precoding matrix 
from a codebook of precoders V of size \P\, and feeds that 
index back to the transmitter using log 2 \P\ information bits; 
see Fig 1 . In order to develop effective methods for quantizing 
the precoding matrix, we first need to characterize the optimal 
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Fig. 1. MIMO transceiver with DFE using limited feedback. 

precoding matrix for different design criteria in the presence 
of perfect CSI. We will then use the statistical distribution of 
this optimal precoder to define the distortion measures that 
are required to design the codebook for the limited feedback 
scheme. 

III. Unified Framework for Zero-Forcing DFE 

In this section, we develop a general framework for the 
joint design of the transceiver matrices G, C = I + B, and 
P in the presence of perfect CSI. We consider system design 
criteria that are expressed as functions of the (logarithm of the) 
MSE of the individual data streams The proposed frame- 
work embraces a wide range of design objectives. It includes 
objectives for which optimal designs are already available 
(e.g., the total MSE, [20]), and several other objectives for 
which the optimal transceiver design has remained an open 
problem. The framework can be regarded as a counterpart 
for the existing framework of linear transceiver design [1]. 
Here, the framework is derived for DFEs with a zero-forcing 
constraint, but an analogous framework can be developed in 
the absence of this constraint [2]-[5]. 

A. ZF-DFE Receiver Design 

The zero-forcing design criterion implies 

GHP - B = I. (5) 

Given the assumption that K < rank(H), the condition in (5) 
can be achieved so long as P is chosen such that rank(HP) = 
K. In that case, the feedforward matrix G is given by 



G = C(HP)t. 



(6) 



Since HP has full column rank, the pseudo-inverse in (6) can 
be written as 



(HP) T = (P H H fl HP) 1 P ff H fl 



(7) 



Using the expression for G in (6), the MSE matrix in (4) 
reduces to 

E = CNC ff , (8) 

where N = er^ (P H H ff HP) _1 is a positive definite Hermi- 
tian matrix. The optimal matrix C, that minimizes the MSE 
of each individual data stream, subject to being unit diagonal 
and lower triangular, is given by [2] 



C = Diag(Ln, . . . ,Lkk) L \ 



(9) 
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where N = LL ff is the Cholesky factorization of N, and L is 
a lower triangular matrix with strictly positive diagonal entries. 
Using this optimal C, the MSE matrix can be rewritten as 



E = Diag (L? 1; ...,L 2 ^), 



(10) 



where L,, is the i th diagonal element of L. Hence, the SNR 
of each data stream is 



Definition 2 (Schur-convex and Schur-concave functions): 
A real-valued function /(x) defined on a subset A of M. K is 
said to be Schur-convex if 

a^bon.4^/(a)</(b), 

and is said to be Schur-concave if 

a -< bon.4 => /(a) > /(b). 



(16) 



SNR fe 



1 



1 



(11) 



B. Transmitter Design 

Given the optimal G and C, our next step is to design a 
precoding matrix P so as to optimize design criteria that are 
expressed as functions of the (logarithm of the) MSE of each 
individual stream. To derive the optimal precoding matrix, we 
will first obtain some inequalities that involve the logarithm 
of the MSE of the individual data streams, 



i = (lnL? 1) . 



(12) 



using concepts from majorization theory. 

Definition 1 (Additive Majorization [21 ]): Let a, b £ 
and let am, . . . , a,[ K ] denote the re-ordering of the elements 
of a in a non-increasing order; i.e., am > . . . > arjfi. The 
vector b is said to majorize a, a -< b, if 



(17) 

□ 

In particular, we will consider communication objectives that 
can be expressed as the minimization of increasing func- 
tions of the MSEs of each data stream, o(Lf 1; . . . ,\? KK ) = 
g(e l1 , . . • , c 1k ) = g(e l ), that are either Schur-convex or Schur- 
concave functions of the logarithm of the MSEs, I. 

Let H H H = UAhU h be the eigen value decomposition 
of H H H such that the entries of the diagonal matrix Ah are 
squared singular values of H, cr|(H), in descending order. 
Let Ui and Ahi be the first K columns of U and Ah, 
respectively. The optimal precoders for the above two classes 
of design criteria are given by the following theorem. 

Theorem 1: The optimal precoder for the class of objectives 
for which g(e l ) is a Schur-convex function of the logarithm 
of the MSEs is independent of the actual form of g(-) and is 
given by: 



Pt 



total 

IT 



UiV(A H i 



(18) 



j j where V(Ahi) is a unitary matrix that results in the QR 

a[j] < V] b[j] for j = 1, . . . , K — 1, (13) decomposition of A H ^ 2 V(A H i) = QR having an R factor 



i=l i=l 
K K 

i=l i=l 



(14) 



□ 



The following lemma will play a key role in our framework. 

Lemma 1: For the Cholesky factorization N = LL H , the 
following inequalities hold: 



Indet(N) 
K 



(!,...,!) < I < (lnAi(N), lnAtf(N)), 



where Afc(N) is the fc th largest eigen value of N. □ 
Proof: To prove the first inequality, we observe that any 



vector a £ K majorizes its mean vector a, whose elements 

K 



are all equal to the mean a; = -h 2~2%=i a «- That is, 



with equal diagonal elements. 
For the class of objectives for which g(e ) is a Schur-concave 
function of the logarithm of the MSEs, the optimal solution 
results in B = 0, and hence the optimal zero-forcing linear 
transceiver is an optimal transceiver for a system with a zero- 
forcing DFE. 

Proof: See the Appendix. ■ 
Algorithms for obtaining a matrix <I> such that the R-factor 
of the QR decomposition of A$ has equal diagonal elements 
were introduced in [24], [25], and V in (18) can be obtained 

— 1/2 

by applying the algorithms therein to the matrix A H1 . 

As illustrated by the following examples, the developed 
framework embraces a wide range of design criteria: 

• Minimization of the sum of the individual MSEs: In this 
case the objective is to minimize 



a -< a. 



(15) 



9(e l ) 



Since N = LL H , we have that l\k=i L tk = dct(LL H ) = 
det(N). Hence, the first inequality follows directly. The sec- 
ond inequality follows by applying Weyl's inequality [22] to 
the matrix L. ■ 
It is worth observing that the second inequality in Lemma 1 
holds with equality when L is normal [22]. Since L is a lower 
triangular matrix, in order to be normal it must be a diagonal 
matrix [23]. If L is diagonal, the matrix C will then be equal 
to I and decision feedback equalization will reduce to linear 
equalization. 

The proposed designs will be based on the following classes 
of functions [21]. 



K 

fe=i 



(19) 



Here, g{e l ) takes the form 2~2k=i h(lk) f° r me convex 
function h(lk) = e lk , and hence it is a Schur-convex 
function of I, [21]. 

Minimization of the maximum MSE / Maximization of 
minimum SNR: In this case the objective is to minimize 



g(e l ) = max(e ifc ), 

k 



(20) 



which is the composition of the increasing Schur-convex 
function max^ x^ with the increasing and convex func- 
tion e lk . Hence, g(e ) is also a Schur-convex function 
[21, pp. 63]. 
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• Minimization of the average Bit Error Rate: This corre- 
sponds to the minimization of the objective 

- K K 
g(e l ) = -^BER(SNR fc ) = ^BER(e- ifc ), (21) 

k=l k=l 

where the BER expression will depend on the con- 
stellation used, and we have assumed that the same 
constellation is used for each element of s in (1). It can be 
verified that under a mild constraint on the SNR, the BER 
expressions for BPSK and M-QAM constellations are 
convex functions of If.. Hence, g(e ) is a Schur-convex 
function of I. 

• Maximization of Gaussian mutual information This cor- 
responds to the minimization of 

K 

5 (e i )=^-log(l + e-'*), (22) 
fc=i 

which takes the form 2~2k=i M^fc) f° r & e convex function 
h(lk) = — log(l + e~ lk ), and hence it is a Schur-convex 
function of I. 

• Minimization of the product of MSEs: Minimization of 
the product of the individual MSEs (or equivalently, 
the geometric mean of the MSEs) corresponds to the 
minimization of 

K K 

g(e l ) = log He 1 * =^h, (23) 

fe=i k=i 

which is both Schur-convex and Schur-concave. Further- 
more, since 2~2k=i h = — >~2k=i log(SNR), at high SNR 
the minimization of the product of the MSEs corresponds 
to the maximization of the Gaussian mutual information. 
As demonstrated by Theorem 1 and the above examples, 
the optimal precoder for a system with zero-forcing DFE and 
a design objective from the Schur-convex class simultaneously 
optimizes the total MSE, the average bit error rate, and 
the Gaussian mutual information. MIMO systems with linear 
precoding and equalization do not achieve this simultaneous 
optimality, and in the general case each of these objectives 
results in a different optimal precoder [1]. For design criteria 
that can be expressed as the minimization of objectives that 
are both Schur-convex and Schur-concave, both the optimal 
Schur-convex design in (18) and the optimal linear transceiver 
will yield the same objective value. In the following sections, 
we will consider the efficient design of codebooks for limited 
feedback systems with Schur-convex objectives. Our first step 
will be to obtain the statistical distribution of the optimal 
precoder. 

IV. Statistical Distribution of Optimal Precoder 

FOR SCHUR-CONVEX OBJECTIVES 

The optimal precoder for the Schur-convex class of objec- 
tives can be written as 

P = ^/^P, (24) 

where the matrix P = Ui V(Ahi) belongs to the Stiefel man- 
ifold S(Nt, K) of complex N t x K matrices with orthonormal 



columns. The statistical distribution of P in (24) plays a key 
role in the design of the codebooks, and is established in 
Theorem 2 below. First, we establish an intermediate result. 

Lemma 2: For an i.i.d. Rayleigh fading channel matrix H, 
the matrices Ui and V(Ahi) are statistically independent. 
Furthermore, Ui is isotropically distributed over the manifold 
S(N U K). 

Proof: The proof follows directly from the isotropic 
distribution of the eigen vectors of the Wishart distributed 
matrix H^H and its independence of the eigen values. ■ 
Theorem 2: For an i.i.d. Rayleigh fading channel matrix 
H, the normalized optimal precoder matrix P is isotropically 
distributed over the Stiefel manifold S(N t , K). 

Proof: We first observe from Lemma 2 that Ui is isotrop- 
ically distributed over the manifold S(N t , K). Hence, its prob- 
ability distribution p(Ui) is unaffected by post-multiplication 
by any deterministic unitary matrix Z; i.e., p(Ui) = p(UiZ). 
Hence, 

p(P) = J p(P\V) p(V) dV (25) 

= J P (V 1 )p(V)dV = p(V 1 ), (26) 

Since Ui is isotropically distributed, then so is P. ■ 
It is worth noting that for MIMO systems with linear 
precoding and equalization, the optimal precoder will not be 
isotropically distributed. That is true for a wide range of 
objectives under a total power constraint (e.g., [1] and the 
references therein), and holds for both zero-forcing and MMSE 
linear receivers. That said, some quantization methods for 
linear transceivers have been based on a suboptimal underlying 
scheme that selects the best unitary precoding matrix; e.g., 
[14]. In that case the distribution of the unquantized precoder 
is isotropic. In the case of systems with a zero-forcing DFE, 
we have shown that selection of the best unitary precoding 
matrix is optimal. 

V. Precoder Selection and Codebook design 

In order to study the codebook design problem, we will first 
consider the selection method for choosing the best precoding 
matrix from a given codebook V . 

A. Precoding Matrix Selection 

Given a codebook for quantizing the normalized optimal 
precoding matrix P, V = {P"\ j = \V\}, and a cost 

function <?(■) associated with the design criterion, the receiver 
will select a normalized precoding matrix from the codebook 
that yields the minimum value for the cost function; i.e., the 
receiver will select the index 

are min o(e iJ ), (27) 

where I 3 is the vector containing the logarithm of the 
diagonal elements of V, the Cholesky factor of N J = 
<t^(-^-P J H H HP J ) . The quality of a given codebook 
can be measured in terms of the average degradation in the 
value of the objective that is incurred by using a precoder from 
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the codebook rather than the optimal precoder in Theorem 1 . 
Borrowing terminology from the source coding literature, we 
will refer to this degradation, and various bounds thereon, as 
distortion measures for the quantization scheme. 

B. Grassmann Packing and Codebook Design 

In the following section we will consider the design of 
codebooks to minimize distortion measures for the broad class 
of objectives g(e ) that are Schur-convex in I. As shown in the 
previous section, for these objectives the optimal normalized 
precoder is uniformly distributed over the Stiefel manifold 
S(N t ,K). We observe that the range of the columns of 
any normalized precoding matrix P represents a K dimen- 
sional subspace, i?p-, of C^*. Hence, the desired codebook 
V = {P J ,j = l,...,!? 3 !} represents a set of subspaces 
1Z = {i?pj,j = l,...,!? 3 !}, and each of these subspaces 
can be represented as a point in the associated quotient space, 
namely the Grassmann Manifold; e.g., [26], [27]. In the next 
section, we will relate the problem of designing codebooks 
that minimize suitable distortion measures to the Grassmann 
packing problem that selects a set of subspaces such that the 
minimum pairwise distance between any two subspaces in the 
packing is maximized. The distances between two subspaces 
i?pi and Rp2 can be defined in different ways [28]. For 
example, the projection 2-norm is defined as 



distp ro j 2 (P\p 2 ) 



p'p 1 " -P 2 P 2H 



(28) 



while the Fubini-Study distance is defined as 



dist FS (P\ P 2 ) = arccos det(P 1 H P 2 ) . (29) 

For a given set or a packing of subspaces and a given distance 
measure, we will denote the minimum pairwise distance 
between any two subspaces in the packing by 



Minimization of the sum of MSE: 

K 

g(e l ) = ^e lk <K maxe 1 ' 



K 



fc=i 



k min/c e lk 



(31) 



Minimization of the maximum MSE / Maximization of 
minimum SNR: 



g(e l ) = max(e' fc ) = 



1 



k mirifc e 

Minimization of the average Bit Error Rate: 

K 

g(e l ) = ^BER(e^ fc ) < K BER(mine" 

k=l 



(32) 



(33) 



We observe that each of these bounds is expressed in terms 
of the minimum SNR over the K data streams, SNR m ; n = 
min/j e~ lk . 

Since each of these terms is bounded by the minimum 
SNR, a natural choice for the distortion measure for a given 
codebook is the average loss in the minimum SNR that one 
incurs by using a normalized precoder p quan chosen from the 
codebook V instead of using the optimal normalized precoder 
P° pt . That is, 

£ = E H | SNR min ( P° pt ) - SNR^P^"')} 
_ E H { VdetAm} 



E H { i 



max mm e 

<j<\V\ l<k<K 



<}, (34) 



where (34) follows by observing that the optimal P results in 
l k = lnd ^( N ) f or ever y ^ Consider the second term in the 
distortion measure in equation (34). From the definition of the 
majorization relation a ~< b, we have apj < b^j. Hence, 
from Lemma 1 we have 



min dist(P"', P ). 

l<i<j<\T\ 



(30) 



In addition to the minimum distance of the packing d, we 
will also be interested in its density D; e.g., [28]. In our 
context, the density is the probability that the range space of an 
isotropically distributed unitary matrix falls within a distance 
d/2 of any of the subspaces of the packing, and is function 
of d, \P\ and the volume of the manifold; see [28]. In the 
following two sections, we will show that codebooks from 
certain optimized Grassmann packings minimize distortion 
measures that are appropriate for two subclasses of the Schur- 
convex objectives: the strict Schur-convex objectives, and 
the objectives that are both Schur-convex and Schur-concave 
functions of I. 



max lk < In Ai (N) = In 



l<k<K 

from which it follows that 



,(HP! 



(35) 



Eh - ! max mm e 

i<i<l"P| i<k<K 



f > Eh | max 

J L 1 < j < 1 7=" | 



<in(HP 3 )' 



(36) 



Hence, the distortion measure in (34) is upper bounded by 

4( Hpi )' 



Eh{ VdetAm} 



C„ 1-1<7<|-P| 



3h{ 



■}■ (37) 



When codebooks are designed from a Grassmann packing 
using the projection 2-norm distance in (28), the expectation 
on the right hand side of (36) satisfies [14], 



C. Codebook Designs for Strictly Schur-convex Objectives Eh - ! max er^ lin (HP-') > > En{cr|f (H)}Z? pro j2( 1 

1 1< 7 < I V I J V 



In this section we will present suitable distortion measures 
for objectives g(e ) that are Schur-convex functions of I 
and are not Schur-concave; e.g., the sum of the MSEs, the 
maximum MSE and the BER. From the first principles, we 
can obtain the following bounds on the these objectives: 



proj2 



(38) 

where d pro j2 is the minimum pairwise distance of the packing 
(cf. (30)) for the projection 2-norm distance, and D pr0 j2 is 
the corresponding packing density; cf. [28]. In addition, for a 
given \V\ the right hand side of (38) is an increasing function 



6 



of the packing distance d pro j2- Using the inequality in (37), we 
obtain the following upper bound on the distortion: 



c < E H { VdHtA^} E H {^(H)} / 

£ " ^ ^ ^2^1- — 

(39) 

which, for a given \V\, is a decreasing function of the packing 
distance dp ro j2. The bound on the right hand side of (39) 
can be easily manipulated by choosing the codebook from a 
Grassmann packing that is designed to maximize the packing 
distance d in (30) with projection 2-norm as the distance 
metric. Such designs correspond to minimizing the bound on 
the distortion. 

Since permutation matrices are special cases of unitary 
matrices, the limited feedback approach in [17], [18], in 
which the precoder is chosen from a codebook of permutation 
matrices, is a special case of our proposed design. However, 
the resulting codebooks do not necessarily have the maximum 
packing distance. Furthermore, the size of the codebook in the 
approaches in [17], [18] is fixed for a given N t and K, while 
the Grassmann packings can be constructed for an arbitrary 
number of codewords. 



D. Codebook Designs for Objectives that are Both Schur- 
convex and Schur-concave 

For communication objectives g(e l ) that are both Schur- 
convex and Schur-concave functions of I, such as the 
minimization of product of the MSEs, we observe 
that the design problem corresponds to maximization of 
dct(P ff H if HP)/(T T 2 l . Hence, a suitable distortion measure for 
the codebook is 



(P° ptff H ff HP° pt ) 



□uant H 



=rquant 



)}/<* 



(40) 



£ = E H {det( 

- det(P 
= E H {detA m 

-E H ( max det(P J ff H H HP J )}/cr 2 
Vi<j<\v\ 

< E H {detA m }/^ 

-E H {dctAHi}E H { max det(P J ^UiUf P J )}/<r£. 

(41) 

Here, (41) follows from the independence of U and A. When 
codebooks are designed from a Grassmann packing using the 
Fubini-Study distance in (29), the last expectation on the right 
hand side of (41) satisfies the following inequality [14]: 



E 



h| max det(P"' ff Ui 



i<i<|-P| 



UfP j )} 



> D FS cos 2 (d FS /2). 



(42) 

Hence, we obtain the following upper bound on the distortion: 

£ < E H {dct A H i}(l - ^ FS cos 2 (d FS /2))/ ( T 2 , (43) 

which, for a given \V\, is a decreasing function of the 
packing distance dps- A similar upper bound was proposed for 
designing codebooks for MIMO systems with linear receivers 
[14]. 



E. Comparison with ZF-Linear Schemes 

In this section, we will show that for a given codebook, the 
performance of the zero-forcing DFE with limited feedback 
provides an upper bound on the performance of its linear 
zero-forcing counterpart for any Schur-convex performance 
objective g(e ). As stated in the following lemma, this is true 
for any codebook, including those codebooks constructed from 
non-unitary matrices. 

Lemma 3: Consider a codebook of precoding matrices, V, 
and a Schur-convex performance g(e l ). For any given channel 
H, let Zq FE denote the vector I in (12) when the precoder P J 
is used, and let the i^in denote the corresponding vector for 
the case of linear equalization. Then 

g(e 1 ™^) < 



mm 

=1,...,|-P| 



mm ale Lin . 



Proof: Consider a given channel H and any precoding 
matrix P J 6 V . For the linear zero-forcing receiver we 
have C = I. It follows from (9) that the corresponding 
matrix N J and its Cholesky factor \J are diagonal. Hence, 
(Mi) 2 = M NJ )> or > equivalently, 



V 



{\n\ l {W) 1 ...MX K {W)). 
On the other hand, for the DFE receiver we have 
^^(ln^) 2 ,...,^^) 2 ). 

From Lemma 1, we have l J Dm -< l J Un , hence g(e lloFIL ) < g{e 1 ^') 
and 

min g(e' DFE ) < min g(e lun ). 

j=l,...,\V\ j=l,...,\V\ 



VI. Simulation Studies 

In this section, we simulate the performance of the pro- 
posed limited feedback MIMO schemes over a standard i.i.d. 
Rayleigh block fading channel model. 1 For the error rate 
performance comparisons, we use 16-QAM signaling and we 
plot the average bit error rate (BER) of the K data streams 
against the signal-to-noise-ratio, which is defined as the ratio 
of the total average transmitted power P tota ] to the total receiver 
noise power E{n ff n}. We compare the performance of the 
proposed codebook designs for systems with zero-forcing DFE 
with that of the optimal zero-forcing DFE transceiver for 
the case of perfect CSI that was presented in Section III. 
For the proposed limited-feedback schemes, the Grassmann 
codebooks are constructed using the design approach in [29]; 
see also [14]. (Grassmann codebooks could also be constructed 
using the optimization algorithms in [26], [27]). We also 
provide simulation-based comparisons with the two limited 
feedback schemes for zero-forcing DFE systems in [18]. In 
addition, we provide performance comparisons with limited 
feedback schemes for linear zero-forcing transceivers that use 
Grassmann codebooks [14], and with the optimal zero-forcing 
linear transceiver designs for the case of perfect CSI for 

'The coefficients of the channel matrix H are modelled as independent 
circularly symmetric complex Gaussian random variables with zero mean and 
unit variance. 
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SNRin dB 

Fig. 2. BER performance of various MIMO transmission schemes with 
zero-forcing DFE for a system with Nt = 6, N r = 3, and K = 3 
simultaneously transmitted 16-QAM data streams. The schemes considered 
are: the proposed codebook designs for the objectives of minimization of the 
sum of MSEs (Grassmann-6 bits- Sum MSE), minimization of the average 
BER (Grassmann-6 bits- Min BER); the optimal zero-forcing design for any 
Schur-convex design objective with perfect CSI (ZF DFE - Perfect CSI); and 
the limited feedback schemes in [18], which are based on feeding back the 
detection ordering (Ordering Feedback - ZF DFE) and (Ordering Feedback2 - 
ZF DFE). The lower curve for each method represents the BER performance 
obtained under the assumption of correct previous decisions. 



minimum MSE and minimum bit error rate design criteria 
[30]. 

A. Comparisons with Limited Feedback Zero-forcing DFE 
Schemes 

In Fig 2, we consider a MIMO system with N t = 6 transmit 
antennas and N r = 3 receive antennas that transmits K = 3 
independent data streams. We compare the performance of 
the proposed schemes with Grassmann codebook designs and 
precoder selection based on the minimization of the sum of 
the MSEs (Grassmann-6 bits- Sum MSE), minimization of 
the average BER (Grassmann-6 bits- Min BER), and the mini- 
mization of the maximum MSE (Grassmann-6 bits- Max MSE) 
which is equivalent to the maximization of minimum SINR. 
The codebooks consist of 64 unitary matrices, and hence 6 bits 
of feedback are used per block. We also make comparisons 
with the limited feedback schemes in [18] (Ordering Feedback 
ZF-DFE and Ordering Feedback2 ZF-DFE) in which the 
receiver feeds back the index of the selected permutation of the 
columns of H from the set of possible = N t \/(N t — A')! 
permutation matrices. For the system under consideration, the 
number of possible permutations matrices is 120, almost twice 
the size of the Grassmann codebook. In the scheme denoted 
Ordering Feedback ZF-DFE the permutation matrix is selected 
based on the norms of the columns of H, while the scheme 
denoted Ordering Feedback2 ZF-DFE the permutation is se- 
lected based on a greedy ordering of the QR decomposition 
of the channel matrix H. In Fig. 2, we observe the close 
performance of the proposed codebooks with different Schur- 
convex selection criteria. This is to be expected, because in the 




5 10 15 20 25 

SNRindB 



Fig. 3. BER performance of various MIMO transmission schemes with 
zero-forcing DFE for a system with Nt = 5, N r = 4, and K = 4 
simultaneously transmitted 16-QAM data streams. The schemes considered 
are: the proposed codebook designs for the objectives of minimization of the 
sum of MSEs (Grassmann-6 bits- Sum MSE), minimization of the average 
BER (Grassmann-6 bits- Min BER); the optimal zero-forcing design for any 
Schur-convex design objective with perfect CSI (ZF DFE - Perfect CSI); and 
the limited feedback schemes in [18], which are based on feeding back the 
detection ordering (Ordering Feedback - ZF DFE) and (Ordering Feedback2 
- ZF DFE). 




2 1 1 1 1 1 1 1 1 1 

-2 2 4 6 8 10 12 14 

SNRindB 

Fig. 4. Average of Gaussian mutual information in (22) for various MIMO 
transmission schemes with zero-forcing DFE for a system with Nt = 5, N r = 
4, and K = 4. The schemes considered are: the proposed codebook designs 
for Gaussian mutual information objective (Grassmann-6 bits- Mutual info); 
the optimal zero-forcing design for any Schur-convex design objective with 
perfect CSI (ZF DFE - Perfect CSI); and the limited feedback schemes in [18], 
which are based on feeding back the detection ordering (Ordering Feedback 
- ZF DFE) and (Ordering Feedback2 - ZF DFE). 
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limit of infinite feedback (i.e., perfect CSI), all these objectives 
result in the same optimal precoder design. We also observe 
that the Grassmann codebooks provide significantly better 
performance than the schemes that are based on precoding 
with permutation matrices, even though they employ fewer 
feedback bits. This is because codebooks constructed from 
permutation matrices are special cases of those constructed 
from unitary matrices, and they do not necessarily minimize 
the distortion measures. Note that for all error performance 
figures in this paper, the simulation results of all ZF-DFE 
methods include the effect of error propagation. For reference, 
in Fig 2 we also provide the performance under the assumption 
of correct previous decisions; i.e., no error propagation. We 
observe that at high SNRs, the practical performance of the 
optimal zero-forcing DFE transceiver for the case of perfect 
CSI and the proposed designs based on Grassmann codebooks 
are close to their corresponding performance in absence of 
error propagation. This also holds for the permutation feedback 
scheme (Ordering FeedbackZ ZF-DFE). 

Analogous performance advantages to those in Fig 2 are 
observed in Fig 3, which shows the performance for a MIMO 
system with N t = 5 transmit antennas and N r = 4 receive 
antennas that transmits K = 4 data streams. The size of each 
permutation-based codebook is 120 matrices, while the size of 
each Grassmann codebook is 64 matrices. 

In Fig 4 we compare several different methods in terms 
of the Gaussian mutual information that they achieve. We 
consider a system with Nt = 5, TV,. = 4, and K = 4, 
and we plot the average, over 1000 channel realizations, of 
the Gaussian mutual information achieved by the ZF-DFE 
transceiver with the quantized precoder; i.e., the average of 
the values of (22) achieved by the quantized precoder. For 
the proposed scheme we consider a Grassmann codebook 
design and precoder selection based on the maximization of 
the Gaussian mutual information (Grassmann-6 bits- Mutual 
info.), and a codebook that consists of 64 unitary matrices. 
We make comparisons with the limited feedback schemes in 
[18] (Ordering Feedback ZF-DFE and Ordering FeedbackZ 
ZF-DFE), whose permutation-based codebooks contain 120 
matrices. We observe that the proposed Grassmann codebook 
with precoder selection based on the maximization of the 
Gaussian mutual information provides the closest performance 
to the optimal ZF-DFE design for the case of perfect CSI, 
which was presented in Section III. 

B. Comparisons with Limited Feedback Linear Zero-forcing 
Schemes 

In Fig 5, we consider a MIMO system with N t = 5 transmit 
antennas and N r = 4 receive antennas that transmits K = 4 
independent data streams. We compare the performance of 
the proposed ZF-DFE schemes that use Grassmann codebooks 
with that of the corresponding linear zero-forcing schemes that 
use Grassmann codebooks with the same feedback rate [14]. 
We consider linear limited feedback schemes with different 
precoder selection criteria, namely minimization of the total 
MSE (LinZF-Grassmann-6 bits Sum MSE), and maximization 
of the minimum eigen value of the overall channel HP 




-e — ZF-DFE S-Convex Perfect CSI 

Grassmann-6 bits- Min BER 

■ Grassmann-6 bits- Max MSE 

■ + - Grassmann-6 bits- Sum MSE 

■ B - LinZF Min-MSE Perfect CSI 

■ $ - LinZF Min-BER Perfect CSI 

- - LinZF-Grassmann-6 bits Max MSE 

■ * - LinZF-Grassmann-6 bits Sum MSE 



Fig. 5. BER performance of various MIMO transmission schemes with 
zero-forcing linear and DFE systems with Nt = 5,-/V r = 4, and K = 4 
simultaneously transmitted 16-QAM data streams. The schemes considered 
are: the proposed codebook designs for the objectives of minimization of the 
sum of MSEs (Grassmann-6 bits- Sum MSE), minimization of the average 
BER (Grassmann-6 bits- Min BER); the optimal zero-forcing design for any 
Schur-convex design objective with perfect CSI (ZF DFE - Perfect CSI); 
the optimal linear zero-forcing design for minimum MSE (LinZF Min-MSE 
Perfect CSI) and minimum average BER (LinZF Min-BER Perfect CSI) [30]; 
and the linear zero-forcing limited feedback schemes in [14] for minimum 
total MSE (LinZF-Grassmann-6 bits Sum MSE) and minimum maximum 
MSE (LinZF-Grassmann-6 bits Max MSE). 




- ZF-DFE S-Convex Perfect CSI 

- Grassmann-4 bits- Max MSE 

- Grassmann-4 bits- Sum MSE 

- LinZF Min-MSE Perfect CSI 

- LinZF Min-BER Perfect CSI 
LinZF-Grassmann-4 bits Max MSE 

- LinZF-Grassmann-4 bits Sum MSE 
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Fig. 6. BER performance of various MIMO transmission schemes with 
zero-forcing linear and DFE systems with Nt = 4,7V r = 3, and K = 3 
simultaneously transmitted 16-QAM data streams. The schemes considered 
are: the proposed codebook designs for the objectives of minimization of the 
sum of MSEs (Grassmann-6 bits- Sum MSE), minimization of the average 
BER (Grassmann-6 bits- Min BER); the optimal zero-forcing design for any 
Schur-convex design objective with perfect CSI (ZF DFE - Perfect CSI); 
the optimal linear zero-forcing design for minimum MSE (LinZF Min-MSE 
Perfect CSI) and minimum average BER (LinZF Min-BER Perfect CSI) [30]; 
and the linear zero-forcing limited feedback schemes in [14] for minimum 
total MSE (LinZF-Grassmann-6 bits Sum MSE) and minimum maximum 
MSE (LinZF-Grassmann-6 bits Max MSE). 



9 



(LinZF-Grassmann-6 bits Max MSE), which corresponds to 
minimization of the maximum MSE [14]. We also provide per- 
formance comparisons with the zero-forcing DFE transceiver 
design for perfect CSI that simultaneously optimizes any 
Schur-Convex design criteria, and with the corresponding 
optimal zero-forcing linear transceiver designs for perfect CSI 
that minimize the total MSE or the average BER. Unlike the 
DFE case, these two design criteria result in different precoder 
designs [30]. In Fig. 5, we observe that the proposed zero- 
forcing DFE systems with limited feedback perform better 
than the corresponding linear schemes; as is to be expected, 
c.f. Lemma 3. Similar performance advantages are observed in 
Fig 6 for a MIMO system with Nt = 4 transmit antennas and 
N r = 3 receive antennas that transmits K = 3 independent 
data streams. 



VII. Conclusion 

We have considered the design of multiple-input multiple- 
output communication systems with zero-forcing decision 
feedback equalization (DFE) when only limited rate feedback 
from the receiver to the transmitter is available. We considered 
schemes in which the receiver uses its CSI to select the 
best available precoder from a codebook of precoders and 
then feeds back the index of this precoder to the transmitter 
using a small number of bits. To facilitate the development of 
the limited feedback scheme, we developed a unified design 
framework for the joint design of the precoder and DFE 
receiver when perfect channel state information is available. 
We then characterized the statistical distribution of the optimal 
precoder in a standard Rayleigh fading environment, and 
showed that codebooks constructed from Grassmann packings 
minimize an upper bound on an average distortion measure. 
Our simulation studies showed that the proposed limited 
feedback scheme can provide significantly better performance 
with a lower feedback rate than the existing schemes in which 
the detection order is fed back to the transmitter. 



Appendix 

A. Optimal Precoder for Schur-convex Functions 

If g(e l ) is a Schur convex function of I, then from Lemma 1 



we have that 



g(e T ) < g(e l ), 



(44) 



and the optimal value is obtained when all lj are equal to 



U = ^lndet(N). 
iv 



(45) 



Hence, all MSEs are equal to E ri = L% = ^/det(N). Since 
the objective is an increasing function of the individual MSEs, 
the design goal reduces to minimizing detN subject to the 
power constraint on the precoder and to the constraint that 
diagonal elements of the Cholesky factor of N are all equal. 
We will start by characterizing the family of precoders that 
minimize det(N) subject to the power constraint, then we 
will show that there is a member of this family that yields 



a Cholesky factor of N with equal diagonal elements. Mini- 
mizing det(N) is equivalent to maximizing dct(P ff H fl HP), 
and the family of optimal precoders is given by [31]: 



^UiV, 



(46) 



where Ui £ ^N t xK con t ams th e eigen vectors of H H H cor- 
responding to the K largest eigen values, and V £ 



-<KxK 



IS 



a unitary matrix degree of freedom. To complete the design of 
P, we need to select V such that the Cholesky decomposition 
of N = LL ff yields an L factor with equal diagonal elements. 
Using (46) we have that 



N = 



-1/2., 



M f H 1/2 
Ptotal V H1 

LL ff = R ff R = (QR) H (QR), 



HI 



(47) 



where Ahi is the diagonal matrix containing the largest K 
eigen values of H H H, and Q is a matrix with orthonormal 
columns. Therefore, finding V is equivalent to finding a V 
such that QR decomposition of (A H y 2 V) has an R-factor 
with equal diagonal. This problem was solved in [24], [25], 
and V can be obtained by applying the algorithms therein to 



the matrix A 



-1/2 
HI 



B. Optimal Precoder for Schur-concave Functions 

If g(e l ) is a Schur-concave function of Z, then from 
Lemma 1 we have that ^(e') is minimized when L|j = A;(N), 
and that this equality holds when L is normal matrix. Since L 
is a lower triangular matrix, in order for it to be normal it must 
be a diagonal matrix [23]. The optimal C in that case is I, and 
hence B = 0. That is, in the case of Schur-concave functions 
of I, the optimal ZF-DFE design results in zero-forcing linear 
equalization. 
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